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Abstract 



We calculate the standard deviation of {N\ — No), the difference of the 
number of agents choosing between the two alternatives of the minority game. 
Our approach is based on two approximations: we use the whole set of possible 
strategies, rather than only those distributed between the agents involved in a 
game; moreover, we assume that a period-two dynamics discussed by previous 
authors is appropriate within the range of validity of our work. With these 
appproximations we introduce a set of states of the system, and are able to 
replace time averages by ensemble averages over these states. Our results 
show a very good agreement with simulations results for most part of the 
informationally efficient phase. 

PACS Numbers: 05.65.+b, 02.50.Le, 87.23.Ge 

I. INTRODUCTION 

The minority game (MG) is an adaptive game introduced by Challet and Zhang JlJ to 
study competitive systems whose available resources are finite (ecology, financial systems, 
traffic in Internet, etc). 

At every step of the game, the A" participants (agents) must choose one of two alternatives 
(0 or 1, to buy or to sell, to choose one of two possible routes, etc.), and the winners will be 
those who turn out to be in the minority. Each agent decides her move based on the history 
of the game (a string of m bits with the information about the m previous minority sides), 
using one of her set of s prescriptions or strategies. Feedback is established by a reward 
system whereby every winner agent gets a point; moreover, out of all the strategies in the 
game, those that correctly predicted the minority side also get a (so-called virtual) point. 
The game is adaptive, because an agent will make her choise using the strategy that at this 
particular time has more points. 
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For every time step, let us call No (Ni) the number of agents choosing side (1), such 
that Nq + Ni = N. The main variable that is usually considered is a, the standard deviation 
of the difference (Ni — N Q ), as a function of N, m, and s, 



where T is the number of steps of the game. 

The standard deviation a is a measure of the form in which the resources (points) of 
the population are used: whenever \Ni — N \ is small, more agents get a point, implying 
a better utilization of the resources. The model has attracted some attention because for 
certain values of m, s and N, a turns out to be smaller than o~ r , the standard deviation of 
a random-choise game, where the N agents choose sides randomly. This result implies that 
althought there is no agent-agent interaction in the MG, there is some kind of emergent 
coordination; in other words, there is an effective interaction that appears through global 
magnitudes of the system. This collective behavior motivated a diversity of studies of the 
MG, with a variety of techniques: numerical simulations [TJ, 0; mean- field approximations 
0; equivalence with spin-glass models ||; thermal treatments [[J, etc. Manuca et al. || 
have studied the complete succession of minority sides, and used a specific mean-field to 
calculate a. In the present work we calculate a for the same set of values of N, m and s 
used by these authors, obtaining a somewhat better agreement with numerical results. 

The public information on which strategies are based is the record of the last m minority 
sides (that is why m is called the memory of the game). Then, there are 7i = 2 m different 
histories, the history at time t being the string k = (xt-m, ••■> Xt-i) where Xt is the minority 
side (0 or 1) which results after time t. In the following we will write k as an integer between 
and 2 m — 1. Every strategy is a function that assigns outputs (predictions of the following 
minority side) for every one of the Ji possible inputs . Hence, there are £ = 2 n different 
strategies in a game of memory m ; at the beginning of the game each agent is randomly 
assigned s strategies, from the pool of £ (with replacement). 

It is known || that data for o 2 /N displays scaling for every value of s when plotted as a 
function of z = 2 m /N. In fact the plot of o~ 2 /N vs z has two different regions, as it is shown 
in Fig. [I] : (i) z < 0.5, which is an informationally efficient phase. Here a 2 /N is a decreasing 
function of z, showing a change of behavior around a 2 /N pa 1; (ii) z > 0.5, where a 2 /N 
increases and asymptotically approaches the line a 2 /N = 1. This line corresponds to the 
random-choise game. Our calculation is appropriate within a good part of the first region. 

Usually, numerical simulations are carried over for a game of T time steps, and afterwards 
the results are averaged over several independent runs of the game. In every run there are 
£ = sN strategies distributed between the agents, out of the pool £. There are two points 
worth noticing: the pool £ of all the strategies is symmetrical , in the sense that for every 
possible history, the number of strategies predicting % t = 1 is equal to the number of 
strategies predicting xt = 0; on the other hand, every set of £ strategies used in each run is 
only approximately symmetrical, because it is just a finite sample of £. When the sample is 
only of a moderate size, however, the difference between the properties of the sample and 
the pool is small, and the corresponding uncertainty is known. Indeed, this is widely used 
in standard 'sampling' techniques 0: for instance, the uncertainty associated with samples 
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of size ~ 400 is less than « 5%. These considerations are at the base of our calculation, as 
we now explain. 

In the following we will make analytic calculations over the whole pool C, rather than 
over samples of size £. In this form we will benefit from its symmetry, and also will be able 
to calculate several magnitudes by simply counting different sets of pairs of strategies (for 
s — 2). In this sense we are using sampling techniques the other way around: we start from 
the pool, to find the samples' properties. As it will be clear below, this is a mean-field-like 
approach. 

Equation (JTJ) calculates a as a time average over T time steps. We will replace this by 
an ensamble average over a (restricted) set of states of the system. In Sec. II we introduce 
the states and describe how a period-two dynamics observed in Ref. |§ can be used to 
substantially reduce the number of states needed. In Sec. Ill we write down a in terms of 
the new variables; to carry out the actual calculation of a we found it useful to make use of 
some ad-hoc diagrams that are also explained. In the final section we offer some conclusions. 

II. THE SPACE OF AVAILABLE STATES 

In this work we will only consider games where s = 2, i.e. every agent has two strategies 
to choose from. To find a from Eq. ([I]) we need to write down expressions for Ni/N and 
Nq/N. It is clear that the values of these ratios will fluctuate with the actual numerical 
realization. We now assume that we can obtain approximate values for these variables by 
using not N, the actual number of agents, but rather a game with A/", the maximum amount 
of (virtual) agents that can originate in the pool. Af is equal to („) +C, the number of pairs 
of strategies that it is possible to make from the pool C (with replacement) [] . Within this 
approximation, we will calculate Ni/N as ~ Af\/N , and N Q /N « Af /Af, where A/o(A/i) is 
the number of virtual agents choosing side (1). 

We now describe the states to be used in our ensemble average. It is convenient to 
begin by introducing microstates of the system. For each time step t, we define a(k') = 
rii(k') — no(k'), the accumulated difference between the number of times that after the 
appearance of string k', the resulting minority side was 1 (n 1 (A; / )) or (n (k')). A microstate 
H is given by (7?, k ), where ~~ct = (a(0), a(l), a{7i — 1)) has the information of the net 
amount of virtual points assigned to the pool of strategies, and k is the string containing 
the history that effectively showed up at time t. 

The set of microstates is rather complex and very big. In order to get simpler expressions, 
we now turn back to the other approximation in which this work is based on, the period-two 
dynamics observed in [Q. Let us consider a game with s = 2, m = 2; in this case there 
are only four strings k : 00, 01, 10 and 11, representing all possible results for the last two 
minority groups. In their study of the time series of minority groups, Manuca et. al. kept 
a record of all the times when a given string k, for instance k = 01, appeared. This record 
shows a most remarkable behavior: odd occurrences of k are followed by a minority side x 



Notice that this excludes the situation where two agents get the same pair. In any case, the 
probability that this happens in the game is vanishingly small even for a moderate value of m. 



3 



whose value is essentially random (e.g. Xodd = 1), while for the next (even) occurrence of 
k the result is deterministic, being the opposite of Xodd (Xeven = in this example). This 
behavior was described as a period-two dynamics (PTD) by those authors, and can be seen 
to be essentially true for m = 2. In fact, we found that a plot of the probability that the 
data follows this rule, as a function of z — 2 m /N, displays scaling as can be seen in FigfJ, 
where one can also see that this probability is greater than 0.5 if z < 0.7 . 

The simultaneous use of the pool £ and PTD allows a dramatic simplification of the 
ensemble of states. To understand this point, let us consider one step t of the game where 
a minority string k is followed by a minority side Xu as mentioned above, C/2 strategies 
predict this output, and therefore get a point. Now, the next time that the string k appears, 
because of PTD, the other half of the strategies (predicting the opposite minority side) 
should get a point. Therefore, after an even appearance of k, all strategies will have one 
extra point. But remember that these points are assigned so that one can choose the more 
succesfull strategies, simply by picking up that with the greatest amount of points; in this 
sense, nothing should change if, rather than adding a point to the second half of strategies, 
we remove the point we already assigned to the first part. The implication of this procedure 
is great: we need to consider only those microstates where \a(k)\ < 1. In the following, we 
will always refer to this subset of microstates, and the corresponding states. 

A state is defined as the set of microstates that have the same value of the variables 
m, n, 4> and p, where n = J2{k'} | a (^')| > = l a (^o)| , an d p = a(k ). As mentioned above, <p 
can take the values or 1, while p can be equal to or {1, -1}, respectively; {k } is the set 
of all values of k . It is of some help to think of (<p,p) as an spin and its projection. In the 
future we will need to write down the number of microstates of the set, i.e. the degeneracy 
of the state, g(n,m,<f),p). 

In FigfJ we show a diagramatic representation of the microstates, that is also helpful to 
visualize the states. Each diagram has TC rows, one for every possible history of m bits; the 
first column shows the value of ~ct (the values 1,-1 and are represented by an up or down 
arrow, and an empty site, respectively), and the second column simply displays the actual 
string k. There can only be one arrow for row, and up to 7i arrows in the diagram. Hence, 
the maximun number of points that the pool L can have is 2 n ^ 1 7i. | 



2 In the recent past there has been some discussion about the relevance of the memory jjj in the 
MG. By randomly generating strings of minority sides, Cavagna obtained results essentially equal 
to those of Fig.||; he concluded that memory is irrelevant in the MG, and should not be used 
to explain its behavior. On the contrary, Savit argued that "the dynamics and the information 
structure in the two versions are fundamentally the same". Our diagrams are particularly useful 
to shed some ligth on this point. To choose a string randomly, simply means to choose in this form 
the row in the rigth column of our diagrams; this eventually will change the 'microscopic' behavior 
of the model, but the register of what happens after the appearance of any history is not changed, 
because the rules that govern the left column of the diagram are not changed. 
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III. THE STANDARD DEVIATION 



We now have completed the characterization of the states, and can come back to the 
calculation of (Mi — Mq)/M. As our approach is based on all the pairs of strategies (i.e. 
virtual agents) that can be formed out of the pool , there are three rather natural magnitudes 
to consider: Mdi ( Mdo) the number of agents whose response to a given microstate will 
be to choose x — 1 (x — 0) with certainty, hence the nickname 'decided' agents ; and 
M u , those agents that under the same circumstance can not make up their minds, thus 
the nickname 'undecided'. Clearly, M = Mdi + A^o + M u . In fact, after establishing a 
method to know how M u will split into those choosing (M u q) or 1 (M u i), we can write 

Mi - M = Mdi + M ul - M d0 - Muo- 

Recalling that every possible pair of strategies corresponds to one virtual agent, to obtain 
the M's is convenient to know Ef(~ct,k) and £q(1?, A;), the number of strategies with x 
virtual points predicting \ = 1 and \ — 0, respectively, as the following minority side after 
the string k. Using Ef and Efi as a shortand notation for Ef(~ct, k) and E^Ht, k), it is 

M u = J2 E i E o 

X 

^ = EE E * E i> + \ E E E * E i + E (^) 2 

X j<X X j^x x 

= E E E i E o + \ E E E o E o + E ( E o ) 2 ( 2 ) 

x j<x x jj^x x 

The last two terms of Mdi and Mdo are equal to C^ 2 +£/2 (where C*, = ), and are 
related with those agents for which both of their strategies predict the same side for the 
string k. As we are interested in the difference (Mi — Mo), these factors will cancel out, and 
don't need to be considered in the following. 

Microstates /ii = (~cti, ki), \ii = (it 2, k 2 ) obtained one from the other by interchanging 
one or more rows in the diagrams of Fig.(|3]) are symmetrical, in the sense that they have 
the same values Ef: Ef(n 2 ) = Ef(/jLi) , (i = or l).This, in turn, implies M u (l^2) = M u ((i>i), 
and similarly for all terms of Eq.( 0). In other words, the values of these magnitudes depend 
only on (m,n,(f),p), i.e are state dependent. Therefore, we will only need to consider the 
E's for different states, together with the corresponding degeneracies g(m,n,4>,p). Thus, 
the expression for the standard deviation becomes 

°" 2 = JP n 9 ( m > Win + Mui - M d0 - M u0 ] 2 (3) 

{m,n,(f>,p} 

where Q = J2{ m ,n,<p, P } #K n,<f>,p). 

For every state, we assume that each undecided agent chooses randomly between x — 
and x — 1- Moreover, a 2 will be averaged over a certain number of independent runs; hence, 
we only need the average values of M u i and A/^ 1; that are given by the standard expressions 
of the random walk, (M u i) = \M U and (M 2 ±) = \M U + \M 2 . 

Then 
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N 2 1 

{m,n,4>,p} 

The calculation of the different terms of Eq.(Q), and the corresponding calculation of o 
(Eq.(^)) is straigthforward, but rather lengthy and cumbersome. We have displaced to an 
Appendix some of the details; on the other hand it is useful to use a graphic representation 
to find Eq, Ef , and show how to obtain these terms. Notice that, regardless of which 
microstate we are looking at, the set of all possible strategies will always split into two 
groups of C/2 strategies each, predicting x = or 1 respectively; therefore, we just have 
to find how each one of these groups will split into smaller subgroups characterized by the 
number of points. Let us consider, to be specific, the case m = 2. In this case, we only need 
to know the E's for (n = to 3, — 0, p = 0), and for (n — 1 to 4, — 1, p — ±1). In 
Fig. (|) we show these values for the case — 1, p — — 1, for strategies predicting x — 
as the following minority side. Being 0=1 implies that we have to consider microstates 
having a(k ) ^ 0. The simplest form to find the £"s in this case, is by using the diagrams 
with one to four arrows. Thus, it is clear that there are C/2 strategies with one point [and 
predicting x = 0). With \~ct\ = 2 (i.e. n = 2), there will be C/4 strategies predicting only 
one of the minority sides (having one point), and C/4 predicting both minorities correctly 
(thus having two points); analogously, for n = 3 there will be three groups of C/8, C/4, and 
C/8 strategies, with 1, 2 and 3 points, respectively; finally, for n = 4 there are four groups 
with C/16, C(l + 2)/16, C{1 + 2)/16 and £/16, with 1, 2, 3 and 4 points respectively. 
Graphically, this gives rise to a tree, as that shown in Fig.|], in which every row corresponds 
to a given value of n, and in every site of the tree one writes E^. In fact, introducing 
by writing Ef{m, n, <f>,p) = 2 n ~ n+ ^~ 1 ef(m, n, 4>,p), with i — 1 or 0, it is possible to see that 
the tree associated with e x { turns out to be Tartaglia's triangle, whose analytic properties 
are well known. The cases including states where in = <p = 0) can be handled in the same 
form. The difference between states with = and = 1 it is apparent in the schemes 
(a) and (c) of Fig|| but from the point of view of the trees, it simply implies that when 
= the values of n are 'shifted'; thus the 'upper' row has n = 0, and the 'lowest' one has 
n — Ti. — 1; moreover in row n, say, the different terms will have 0,1,..., n points. All terms 
needed to calculate a are simply related with different sums and products of terms of these 
trees, without mixing factors from different rows. 

To carry out the calculation of a 2 we use Af u , \J\fdi — Mda\ and g, as obtained in the 
Appendix. The actual expressions for J\f u and \Afdi — Ndo\ are functions of the value of 0, 
namely 

A^=(2^— 1 ) 2 C n 2n 
AT d i-Ar do = (5) 



for = 0, and 



\M dl 

for 0=1. 





— U 2n- 


_ 2 if n > 1 


M u 


= 


if n = 1 


A/"do| 


— °2n- 


-2 ' 2n-2 



(6) 
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The state degeneracy is Q 

g(m, n, 0, p) = C n n \U -n + </>) (7) 
where < n < H+(f> — 1 

Two points are worth mentioning, in relation with a 2 . First, using Eq.( [A2|) in Eq.(||), we 
verify that a 2 can be written just in terms of Af u 's factors. Furthermore, this expression 
can be factored out as follows: a 2 = N jF(m). Notice that our general assumptions require 
to consider N big enough, so that the replacement of the samples by the whole pool makes 
sense; hence, the factorization should also be valid in the same limit. 

Our results are presented in Fig|| We have done numerical simulations of the MG for 
N from 101 up to 1001, 50000 time steps, and m = 2 up to 13; the results where averaged 
over 32 independent runs. We show both the usual simulation data, as well as the results 
of the calculation with the equations derived in this work. As it can be seen, if j? > 1 (and 
2 m /N < 0.1) there is an excellent agreement between these two sets of data. For 2 m /N 
between 0.1 and 0.7 we only have a qualitative agreement, and for 2 m /N > 0.7 our results 
(not shown) are clearly inadequate. This coincides with the range of validity of the results of 
Manuca et al. ||, i.e we only cover the informationally efficient phase. It is to be noticed 
that in the first region our results can be fitted to a high degree of accuracy by a straight 
line with slope -1. 

IV. CONCLUSIONS 

We have made a mean-field-like calculation of a, the standard deviation of the MG. Our 
approach is based on two approximations: on the one hand, rather than using the properties 
of the actual samples used to calculate, we consider the whole pool of strategies, £; in this 
form we benefit from the symmetry of the histories. It should be noticed that we actually 
quote results that are averages over several independent runs, so that it is reasonable to 
expect that they should be near to the symmetry of the pool. On the other hand, we 
considered that the period-two dynamics introduced by Manuca et.al || is correct within 
the range of validity of our work. We used these approximations to replace the time averages 
appearing in Eq.fll]), by ensemble averages over the states. 

Our results show an excellent agreement with data from the simulation in the region 
a 2 /N > 1. The calculation embraces an important part of the informationally efficient 
region, but the agreement is lost when a 2 /N is near of its minimun value. 

This method can also be useful to deal with cases where s = 3 (one needs to consider all 
the sets of three strategies) or greater. 

H.C. was partially supported by EC grant ARG/b7-30 11/94/27, Contract 931005 AR. 



3 in fact this expression is totally correct in the case where the microstates have a uniform proba- 
bility distribution; in a game with memory the succession of strings is obviously concatenated and 
Eq.(fJ) is only approximated 
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APPENDIX: 



We provide some extra details about the calculation of Af u , \Ndi — N"do\ an d 9- 

The values of ef , (for i = 0, 1), that we have already identified with the coeficients in 

Tartaglia's triangle, are state dependant (remember that in site (q,w), the coeficient of the 

tree is the combinatorial = ( w ) . 

eg(m,n, 0,0) = e?(m,n,0,0) = (Ala) 
eg(m, n, 1, 1) = ef (m, n, 1, -1) = C*_ x (Alb) 

eg(m, n, 1, -1) = ef(m, n, 1, 1) = C* n Z\ (Ale) 

where 0<x<n, 0<x<n — 1, and 1 < x < n in Eq.( |Al| a-c), respectively. In the first 
equation = and therefore < n < Ji — 1, while the last two equations correspond to 
states with = 1, then 1 < n < 7i. 

We begin with = states, i.e. (m, n, 0, 0) states, for which, following Eq.(Q) 



2=0 21=0 

Moreover, for the same states 

n n n n 

■Mil = 2 2 ^— i) J] ^ efeg = 2 2 ^— « ^ 53 ClCi = Af d0 

x=0 j=Q x=Q j=0 

Therefore, \J\fdi — M20I = 0. 

Let us now consider = l,p = 1 states, i.e. (m, n, 1, 1) states 



n— 1 n— 1 

A/* f)2(H~n) \ ^ x x fy2(H—n) \ /^yx /^yx— 1 /-m 

yVn _ ^, e l e — Z / .Vrn-l ~ °2n-2 



x=l x=l 



for n > 1. If n = 1, it is M n = 0. 
Mil and A/^o are given by 



n— 1 n 

Mil = 2 2 ^-) 53 etei = 2 2 ^) 5] £ C^C^ 
j<2 j=o s=i+i 

n— 2 n— 1 

M rf0 = 2 2 ^)53e^ = 2 2 ^)5] J2 C n-\CU 

x<j x=l j=x+l 



Finally, 



S 



n—1 n—1 
Wdl - A/do| = + ^n-l^nil = ^2(n-l) + ^2n-2 

i=o i=o 

More explicitely, 

(m,n, -M d0 (m,n, l,p)\ = J\f u (m,n- 1, 0, 0) + Af u {m, n, l,p) (A2) 

It is remarkable that the difference \J\fdi — Ado| can be written just in terms of Af u . 

Notice that this 'nasty' collection of coefficients can be identified with combinations of 
elements of the trees. The simplest case is M u in Eq.(|2]); in this case both trees (for x = or 
1) are equal, and M u is given by the sum of the squares of the factors in row n of the tree. 

To calculate the degeneracies g(m,n,(j),p), one has to count the number of microstates 
sharing the same values of (m,n,(f),p), using the diagramatic representation of the states. 
For = 0, 

g(m,n,0,0) = C% (H - n) T (A3) 

where < n < T~L — 1. The first factor counts the number of ways that n arrows can be 
distributed into Ti places, the second is the number of strings available for being the actual 
history (x in the diagrams), and the third takes into account that each arrow can be up or 
down. 

Analogously, we can write g for the case = 1 

g(m,n,l,±l) = C£ n T~ x (A4) 

with \ <n<U. 

It is possible to write these expressions in a form valid for both values of 0, as in Eq. ([?]). 
Finally, the sum over all the degeneracies is the total amount of microstates, Q(m) = 
E {m ,„ Ap} ^Kn,0,p) = 3 2m 2 m . 
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FIGURES 



FIG. 1. Scaling of a 2 /N vs z = 2 m /N, for different values of m and N: UN = 101; o N = 201; 
A N = 301; »N = 401; y iV = 501; x N = 1001. For each value of N and m there are 32 runs, 
each one of T = 10000 time steps; for N = 1001, each run has 50000 time steps. 

FIG. 2. Validity of the period-two dynamics. Number of times in which the dynamics comes 
true, over the total number of times in which the corresponding history occurs for an even time. 
Data are for different values of N: UN = 101; o N = 201; A N = 301; mN = 401; y N = 501; x 
N = 1001. For each value of N and m there are 32 runs, each one of T = 10000 time steps; for 
N = 1001, each run has 50000 time steps. 

FIG. 3. Diagrams used to represent the ensamble of microstates for m = 2. ^From top to 
bottom, the rows correspond to the strings 00, 01, 10 and 11. The symbol x is used to indicate 
the actual history; the arrows are used to represent the virtual points assigned to the strategies. 
For every history with an arrow, there are C/2 points assigned, a: (2,2,0,0); b : (2,2,1,1); c: 
(2, 2, 1, 1); d: (2, 4, 1, —1). Notice that b and c are two microstates corresponding to the same state. 

FIG. 4. Array of strategies corresponding to states with m = 2, cf) = 1, and p = —1. In this tree 
there are represented the C/2 = 8 strategies predicting % = for the actual string. Each row of 
the tree is characterized by a value of n, beginning with n = 1, up to n = 4. In each site of the tree 
the number between brackets gives the amount of virtual points (x = 1, .., n) for the corresponding 
strategies. There is an analogous tree for the strategies predicting x = 1 for the same string. The 
difference of both trees is given only for the virtual points assigned; in the tree for \ = 1 these 
values are displaced, so that each row begins with x = 0, and ends up with x = n — 1. 

FIG. 5. Data of a 2 /N vs z = 2 m /N for N = 1001, both for the numerical simulations and from 
our calculations. As before, the results are averaged over 32 runs of 50000 time steps each. 
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